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(57) Abstract 



A firewall (3) for controlling network data packet traffic between 
internal and external networks (I. 5, 4), comprising filtering means selecting 
from a total set of rules, in dependence of the contents in data fields of a 
data packet being transmitted between said networks, a rule applicable to the 
data packet, in order to block said packet or forward said packet through 
the firewall (3). A 2-dimensional address lookup means (8) performs a 
2-dimensionaI lookup of the source and destination addresses of the packet 
in a set of address prefixes, each prefix having a subset of rules of the total 
set of rules, in order to find a prefix, via its representation, associated with 
said source and destination addresses, and rule matching means (10) for 
rule matching, on the basis of the contents of said data fields, in order to 
find the rule applicable to the data packet. 
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TITLE: FIREWALL APPARATUS AND METHOD OF CONTROLLING NETWORK 
DATA PACKET TRAFFIC BETWEEN INTERNAL AND EXTERNAL NETWORKS 

5 ' 



Field of the Invention 

The present invention relates generally to a firewall 
10 apparatus and a method of controlling network data packet 
traffic between internal and external networks, and more 
particularly to a firewall apparatus comprising filtering 
means for selecting from a total set of rules, depending on 
the contents in data fields of a data packet to be 
15 transmitted between said networks, a rule applicable to the 
data packet, in order to block said packet or forwarded the 
packet through the firewall, and a method thereof. 

Description of the Prior Art 

20 An important issue for most Internet connected 

organisations is security and consequently firewalls are 
becoming an important part in most computer and network 
security strategies in most organisations. Users accessing 
the webserver or other public services of the organisation 
25 must not be able to gain access to internal services such 
as accounting systems, Internet information servers and 
other possibly sensitive company information. The service 
of the systems must not be interrupted - servers and work- 
stations need to be protected against denial-of-service 
30 (DOS) tags from users on the Internet. 

A firewall, or filtering router, is a device that 
works basically the same way as a router. That is, it 
receives packets on an in-interface, inspects the packets 
destination address, and forwards the packet-on the correct 
35 (with respec" to the destination address) out-interface. 
However, a firewall performs a much more thorough 
inspection zi each packet. The source and destination 
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address, source and destination ports, protocol field, 
flags, and options are also inspected and compared to a 
list of firewall rules. Depending on which rule matches the 
packet, the firewall might decide, not to forward the 
packet, for instance if a blocking rule is matched. 

In addition to unauthorized access there are other 
threats that arise when an organisation is connected to the 
Internet. The bottom line is that data received from 
unknown sources cannot be trusted. Scanning for viruses and 
trojan horses in email and webpages are duties performed by 
some prior art firewalls. 

Further, as network bandwidth is increasing, the 
performance of the firewalls are becoming an important 
issue . 

Firewalls can work on many different levels and 
provide different kind of functionality for scanning data 
passing it. However, the basic functionality of all fire- 
walls is to implement filtering based' on the contents of 
the network (IP=Internet Protocol) and transport (UDP, 
TCP=Transmission Control Protocol and ICMP=Internet 
Control Message Protocol) layer headers. Without such IP 
filtering all other functionality, such as data scanning, 
is useless, that is users on the internal network might 
just as well configure their network applications not to go 
through the scanner to connect to remote servers and thus 
bypass all security functionality. 

Companies or organisations are connected to the 
Internet for different reasons, for example in order to 
publish information about a company, its products and 
services on the web, get access to information available on 
the Internet, and correspond via email. 

The company often has internal information that users 
on the Internet must not be able to access, such as 
Internet information servers, file servers etc. The most 
common configuration is to allow connections from the 
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Internet to a set of servers (web, email, and other public 
services), but to deny access to other hosts (for example 
intranet servers). To achieve this a "demilitarised zone" 
(DMZ) is established. Connections to computers in the DMZ 
can be made from the Internet as well as from the intranet, 
but access to the intranet from the Internet is restricted. 
In prior art networks an internal network, such as an 
intranet is connected to the demilitarised zone via a 
firewall and the DMZ is connected to the Internet via a 
router. Consequently, network traffic can pass freely 
between the Internet and the DMZ, which is completely 
unprotected from users on the intranet. A reason for this 
is that prior art firewalls also lack the possibility to 
connect more than two networks - an internal and an 
external network. 

Other firewalls have three network interfaces. Here, 
restrictions can be made concerning traffic between the 
Internet and the DMZ as well as the intranet. Some restric- 
tions are made for traffic to and from hosts in the DMZ, 
for example the web server only needs to be accessible on 
the HTTP (Hypertext Transfer Protocol) port. Internet users 
should not be able to connect to any other services. 
However, users on the intranet might want to be able to 
access the web server in more ways than the Internet users 
for administrative purposes, thus more access should be 
granted in between these two networks. Similar rules are 
needed for the email server; SMTP (Simple Mail Transfer 
Protocol) connections should be allowed from the Internet, 
but reading email should only be possible for certain 
allowed hosts on the intranet, and possibly also from some 
host on the Internet. 

In a firewall environment the number of machines in 
the DMZ is for example 30. The rules for the machines in 
the DMZ can be different for each machine, but the number 
of rules per machine is fairly low, for example 10-15. More 
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rules might apply for traffic from the intranet to the DMZ, 
but these are likely to be more general. Thus, a fairly low 
number of rules are valid for all machines in the DMZ. - 

Further, rules regarding traffic between the Internet 
and the intranet (s) are in most cases few, if any at all. 
Most traffic should be blocked. However, traffic initiated 
from the intranet might be allowed. 

As the number of users on the Internet grows, the 
public servers will be visited more frequently, causing 
more traffic. The traffic to and from the intranet 
increases as the intranet users are taking part of the 
increasing amounts of information available on" the 
Internet. Consequently, bandwidth requirements is 
increasing. This puts greater demands on the performance of 
the firewalls used. 

Thus, the main task for a firewall is packet 
filtering, that is given an IP packet and a set of rules, 
which rule should be applied on this packet? If several 
rules match the same packet a policy needs to be defined to 
specify which rule to pick. There are two prior art 
solutions known to this problem. One solution is to pick 
the rule matching the most number of fields of a packet, 
and if two rules match the same number of fields, but 
different ones, an order needs to be specified between 
them. This is used in the packet classification algorithm 
by Borg and Fiodin, Borg, N. Flodin, Malin, packet 
classification, June 1997; Borg, N., A Packet Classifier 
for IP Networks, Masters Lie, Lulea University of 
Technology, February 1998. Another solution is to define an 
order betweer. the rules and use that order to define which 
rule to pick. An advantage of the second solution is that 
it gives ber.ter flexibility when defining filter rules, and 
the NetBSD firewall code utilise this method. 

A fi-T-r rule comprises a set of criteria that has to 
be fulfill?-., and an action to perform when they are 



WO 00/02114 



5 



PCT/SE99/01202 



fulfilled. The criteria are based on IP source and 
destination addresses (32-bit prefixes), IP protocol field 
(8 bit-integer), whether or not the packet has IP options 
set, and what these options are (integer) due IP/TCP source 
and' destination port numbers (2 16-bit integer ranges), TCP 
header flags (3 bits), ICMP header type and code fields (2 
8-bit integers), what interface the packet was read from (8 
+8 bits), and what interface the packet is to be forwarded 

to (8 + 8 bits) . 

Most firewalls today do not address the rule matching 
problem in particular. It is common to have a linked list 
(or an array) of rules, comparing the packet with each and 
every one of these until a match is found. However, this is 
not efficient. Another approach is hashing of the rules. 
Further, if the method for resolving ambiguities among the 
rules, that is two rules match the same packet, most 
implementations solve the problem by defining the first or 
last matching rule as the one to follow. 

A prior art firewall, PIX firewall by Cisco Systems, 
is a connection oriented security device that protects an 
internal network from an external network. The PIX firewall 
is a very expensive device and it has an upper limit of 
about 16000 simultaneous connections. The main part of the 
PIX firewall is a protection scheme based on the adaptive 
security algorithm (ASA) , which offers stateful connection 
oriented security. ASA tracks the source and destination 
address, TCP sequence numbers, port numbers, and additional 
TCP flags of each packet. This information is stored in a 
table, and all inbound and outbound packets are compared 
against entries in the table .. Hence, information of each 
connection established has to be stored during the lifetime 
of the connection, and thus, the number of connections 
possible are defined by the memory capacity available. A 
fully loaded Cisco PIX firewall can operate at about 90 
Mbite/s. However, the Cisco PIX firewall also supports pert 
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address translation (PAT), whereby more than 64000 internal 
hosts can be served by a single external IP address. 

A prior art packet filter called ipf (IP filter) is 
included with the standard distribution of net BSD 1.3. 

5 The rule sets in ipf are split up on the interfaces 

on which they are valid. Furthermore, the rules are checked 
twice, first when the packet enters the host and second 
when it leaves the host. Rules only valid for inbound 
packets are not added to the list of rules checked at the 

10 output port, and vice versa. The data structure is 
basically an optimised linked list. . 

The Exokernel, Engler, D., Kaashoek, M. F., O'Tool 
Jr, J., Exokernel: An operating system architecture..., 
Proceedings of the 15th ACM symposium on Operating Systems 

15 principles, December 1995, uses a different approach to 
handle packet demultiplexing called DPF, Angler, D., 
Kaashoek, M. F., DPF: Fast, flexible message 
demultiplexing..., Engler, D., Kaashoek, M. F., Computer 
Communication Review, Vo. 26, No. 4, October 1996. The 

20 rules are written in a special programming language, and 
thereafter, the are compiled. The compiler knows about all 
the rules specified, the generated code can be optimised 
for the expected traffic patents. 

25 Summary of the Invention 

It is an objective of the present invention to 
provide an improved firewall apparatus and a method of 
controlling network traffic between internal and external 
networks providing an efficient address lookup and rule 

30 matching process in order to achieve an effective and fast 
IP packet filtering, and an unlimited number of possible 
connections through the firewall. 

This is accomplished by the firewall apparatus and 
method according to the invention, wherein the set of rules 

35 needed to be searched linearly is reduced by segmenting the 
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rule set. The firewall according to the invention comprises 
2-dimensional address lockup means performing a two step 
lookup, first of source and destination addresses of the 
packet in a set of address prefixes. Each prefix is 
associated with a subset of rules of a total set of rules. 
A liner search is performed on the resulting subset of 
rules in order to find the rule applicable to the present 
data packet . 

Another object of the invention is to provide a 
fragment machine enabling filtering of all fragments in a 
fragmented packet. 

Still another object of the invention is to provide 
network address translation means translating internal 
source addresses to external source addresses of a packet 
transmitted from the firewall or external source addresses 
to internal source addresses of a packet transmitted into 
the firewall. 

Another further object of the invention is to provide 
network address translation means translating internal 
source addresses to external source addresses of a packet 
transmitted from an internal network to an external 
network, or external source addresses to internal source 
addresses of a packet transmitted from the external network 
to the internal network. 

Still another object of the invention is to provide 
hole punching means performing a temporary exception from 
an external-to-internal blocking rule for a connection 
initiated from the internal network, wherein a returned 
channel for packets transmitted from the external network 
to the internal network are established through the 
firewall. 

A further object of the invention is to provide a 
firewall capable of handling at least 1000 unique rules. 

Advantageous of the firewall and the method thereof 
according to the present invention are the unlimited number 
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of possible simultaneous connections, the fast IP 
filtering, and the great number of possible rules 
supported . 

Another object of the firewall according to the 
invention is to provide a firewall comprising, a router. 

Brief Description of the Drawings 

In order to explain the invention in more detail and 
the advantages and features of the invention preferred 
embodiments will be described in detail below, reference 
being made zz the accompanying drawings, in which 

FIG 1 is shows common network topology comprising the 
firewall according to the invention, 

FIG 2 is a block diagram of the firewall according to 

15 the invention, 

FIG 3 is an illustrative view of a partition of a two 

dimensional dense chunk, 

FIG 4 is an illustrative view of the data structure 

according tz the invention, 

FIG £ is an illustrative view of a class (0,0) tile, 

FIG 6 is an illustrative view of a class (1,1) tue, 

FIG ' is an illustrative view of a class (1,2) tue, 

FIG S is an illustrative view of a class (2,1) tile, 

FIG 5 is an illustrative view of a class (1,3+) tile, 
25 pig i: is an illustrative view of a class (3+,l) 
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tile, 
tile, 



FIG 11 is an illustrative view of a class (2+,2+ 



LX 



FIG LI. shows an example of an unsuccessful search fo* 
a particular query key in a Patricia Tree containing si: 
keys, and 

FIG shows the Patricia Tree resulting from an 
insertion cf the query key from the unsuccessful search 
according . : FIG 12. 
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De-tailed Description of the Invention 

An example of a modern network topology from a 
company's or an organisation's point of view is shown in 
FIG 1. An internal network 1, such as an Intranet comprises 
5 several network nodes 2 such as PCs, workstations, file 

servers etc, which are connected to a firewall 3. Companies 
or organisations connected to an external network 4 ( 
Internet) intend to publish company related information, 
such as products and services, on the web, get access to 

10 information published by other companies or organisations 
on the Internet, and correspond via email. However, the 
company might have internal information that users on the 
Internet not are allowed to access, for example information 
available via the Intranet information servers, file 

15 servers etc. Thus, to allow Internet users to access public 
information they are allowed to be connected to a limited 
set of servers, for example the web, email etc., and denied 
to access information on other hosts, such as Intranet 
servers. The public servers are available in a 

20 "Demilitarised Zone" (DMZ) 5, which is connected to the 
firewall 3. Further, the firewall 3 is connected to the 
Internet via a router 6, and, hence, connections to nodes 
in the DMZ 5 can be made from the external network or 
Internet 4 as well as from the Intranet 1, but accesses to 

25 the Intranet 1 from the Internet 4 is restricted. 

In the following description, numerous specific 
details, are provided in detail in order to give a more 
thorough description of the present invention. It will be 
obvious for those skilled in the art that the present 

30 invention may be practiced without these specific details. 
Some well-known features are not described in detail so as 
not to make the present invention unclear. 

One embodiment of the firewall and the different 
modules in the fast path and how the filtered packets flows 

35 through according to the invention is shown in FIG 2. 



WO 00/02114 



10 



PCT/SE99/01202 



3 



In a simple case a packet is received from a network 
1, 4, or 5 in a firewall input connection 7 and is applied 
to the input of 2-dimensional address lookup means or a 2d- 
SFT block 8. A intermediate connection 9 connects the 2d- 
SFT and rule matching means cr block 10, wherein the packet 
is either passed (down) or blocked b5. However, in order to 
work properly the firewall according to the invention has a 
number of additional modules. 

In this embodiment a lookup of source address and 
destination address are performed in the 2d-SFT block 8, 
resulting in a rule or actually a short list of rules. The 
rule list remains in the rule matching block 10 until the 
list is searched and a matching rule is found. 
Additionally, information of whether the packet might need 
to be processed by the other modules or not are generated 
by the 2d-SFT lookup. Some of these decisions are taken 
during the rule matching which means that the rule matching 
actually starts before entering the block, as illustrated 
in FIG 2. The 2d-SFT block 8 is described in detail below. 

When a packet is too large to be sent over a link, it 
is fragmented. This means that everything that follows the 
I? header is cut into pieces (fragments) and each fragment 
is supplied with its own IP header. The additional 
fragments flag and the fragment offset is set in each 
fragment tc indicate if it is the last fragment or not, and 
to record where the data of the fragment fits into the 
original (unf ragmented) packet. 

When a packet is fragmented, only the first fragment, 
the fragment header, contains the transport header (TCP, 
UDP, or ICM? header) . This means that the following 
fragments can not be matched against a rule involving for 

example ports . 

According to the invention, a fragment machine 11 
collects fragments from each fragmented packet until the 
fragment header arrives (fragment does not necessarily 
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arrive in order) . Then, the pieces of information present- 
only in the fragment header are stored in the entry 
associated with that fragmented packet, and the collected ^ 
fragments are applied to the .output ol, connected to the 
connection 7, with the fragment header first. Each fragment 
that is transmitted from the fragment machine is supplied 
with the fragment header information, so that it can be 
processed by the filter just as if it was an unfragmented 
packet. The additional fragments flag and the fragment 
offset are checked to determine if the packet is applied to 
the input il - connected to the connection 7 - of the 
fragment machine 11 or not. 

When ail fragments of a fragmented packet has been 
received in the fragment machine 11, the entry for the 
packet is removed. 

At some points, the fragment machine might also 
decide to block fragments. This happens when broken 
fragmented packets arrives (possibly as a result of an 
attack) , if the number of collected fragments exceeds a 
certain limit, or simply as a result of garbage collection 
(old entries are removed to make place for new ones) . 

Network Address Translation (NAT) is commonly used 
when a company have an network with many internal IP 
addresses and only a few external (real) I? addresses. Some 
parts of I? address space are reserved for internal 
addresses, such as 10.*.*.*, 192.168.*.*, and 172.16.*.*. 
These addresses can freely be used on internal/private 
networks. However, they must never be visible on the 
external. Therefore, the firewall is setup to translate 
internal source addresses to external source addresses as - 
packet goes from the internal to an external network. For 
packets going in the other direction, the external 
destination address is translated to an internal address as 
the packets goes through the firewall. In order to map many 
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internal addresses onto a few external addresses, ports are 
also used. 

For example, the firewall is setup to map internal 
addresses from 10.1.0.0 to 10.1.255.255 (2 16 addresses) to 
external addresses 194.22.187.0 to 194.22.187.255 (2 8 
addresses) using ports 20000 to 20255 (2 8 ports) . 

When a connection is initiated from 10.1.1.1 port 
4000 to 130.240.64.46 port 6000, an address a and a port p, 
so that (a,p) does not collide with any other NAT 
connection, is picked from the address and port range. 
Then, each outgoing, internal to external (I2X) , packet 
from that connection, the source address 10.1.1.1 and port 
4000 are' replaced by a and p respectively. For each 
incoming, external to internal "(X2I) packet, the 
destination address a and port p are replaced by 10.1.1.1 
and 4000, respectively. 

In this way, the 256 external addresses together with 
the 256 ports can represent the 65536 addresses of the 
internal network. 

As a result from the 2d-SFT lookup, also information 
about if a packet is subject to an external to internal 
address translation is achieved, and the packer is applied 
on the input 12 of an X2I-NAT block 12 performing the 
external to internal address translation. Therefore, the 
overhead for performing X2I-NAT lookup is removed on all 
packets not requiring translation-. For packets where X2I- 
NAT lookup is performed, the packets' are sent to slow path 
means 13 via its slow path output s2 in the case of failure 
since updates of the NAT data structure are dealt with 
therein. When a successful X2I-NAT lookup is performed, the 
address and ports are changed and a rule matching of the 
new source-destination pair is retrieved before the packet 
is sent to the next filtering step via its output o2 . 

Also, as a result from the 2d-SFT lookup or from the 
X2I - NAT lookup, iz is clear if the packer, is subject to 
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internal to external (I2X) address translation. This is 
performed basically in the same way as X2I-NAT, but is 
performed as the last filtering step. A packet subject to 
internal, to external (I2X) address translation received 

5 from the output connection 15 of the rule matching block 10 
is applied cr. the input i'5 of an I2X-NAT block 14, 
performing the internal to external address translation. 
For packets where I2X-NAT lookup is performed, the. packets 
are sent to the slow path means 13 via its slow path output 

10 s5 in the case of failure since updates of the NAT data 

structure are dealt with therein. When a successful I2X-NAT 
lookup is performed, the address and ports are changed and 
the packet is transmitted to the appropriate network via 
its output o2 and the output connection 15. 

15 The reason for having X2I-NAT as the first step after 

2d-SFT lookup and I2X-NAT as the last step is that 
filtering rules are given with respect to internal 
addresses, which are fixed, and not NAT address, which are 
assigned dynamically. 

20 Usually, most of the "traffic that goes from an 

external network 4 to an internal network 1 is blocked, to 
protect the internal network. However, hosts on the 
internal network are usually allowed to access hosts on the 
external network 4. In order to receive any return traffic 

25 from the external, a temporary exception from the external- 
to-internal blocking rule must be- made for connections 
initiated from the internal network. This is referred to as 
hole punching (HP), i.e a hole for returning packets are 
punched through the firewall. The hole exists only during 

30 the lifetime of the connection, and does only affect 
packets frcr. the connection. 

Hole punching also keep track of the TCP sequence 
numbers in crder to protect hole punched connections from 
being hijacked. Therefore, it is necessary both to perform 

35 HP lookup cr. outbound (I2X)- packets performed by an I2X-HP 
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block 16 and inbound (X2I) packets performed by an X2I-HP 
block 17. 

As a result from the 2d-SFT lookup or from X2I-NAT 
lookup, we know if the packet is subject to internal to 

5 external (I2X) or external to internal (X2I) hole punching. 
This means that we can avoid the overhead from performing 
H? lookups cn packets that can not be subject to hole 
punching. An outbound packet subject to hole punching is 
applied to an input i3 of the I2X-HP block 16, whereby the 

10 source and destination addresses and ports, and the 

protocol, are looked up in order to find an existing state. 
If no such state exists, the packet is sent to the slow 
path means 13 via its slow path output s3, wherein the HP 
data structure is updated and a state is created. If a 

15 matching state is found, TCP-sequence numbers etc are 

update before the packet is sent to the next filtering step 
via another output o3 . 

The X2I-HP is performed in the same way. An inbound 
packet subject to hole punching is applied to an input i4 

20 of the X2I-K? block 17, whereby the source and destination 
addresses and ports, and the protocol, are looked up in 
order to find an existing state. If no such state exists, 
an attempt to send the packet through a non-existent hole 
in a blocking rule has been made and the packet is blocked 

25 at its output b4 . If a matching state is found, it is 
updated before the packet is sent, to the next filtering 
step via. another output o4 . 

Again referring to the 2d-SFT block 8, in dependence 
of the contents in data fields of a data packet being 

30 transmitted between said networks, a rule applicable to the 
data packet is selecting from .a total set of rules, whereby- 
said packet is blocked or forwarded through the firewall. 
In order to reduce the set of rules to be searched 
linearly, the rule set is segmented. According to the 

35 invention, this is performed by means of a 2 -dimensional 
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lookup of the source and destination addresses of the 
packet in a set of address prefixes, wherein each prefix 
has a subset of rules of the total set of rules, in order 
to find a prefix associated with the source and destination 
addresses. Then, based on the contents of said data fields, 
a rule matching is performed by the rule matching means 10 
in order to find the rule applicable to the data packet. 

When performing the 2-dimensional lookup of the 
addresses, each rule is seen as covering a rectangular area 
of a 2-dimensional plane, wherein the offset and size of 
the rectangle is determined by the address prefixes and 
prefix lengths. Hence, the lookup is considered to be the 
same problem as finding the rectangle surrounding a point 
in the plane. To simplify the lookup, a restriction is made 
to assure that each point in the plane is covered by one 
and only one rectangle, resulting in an easier lookup 
procedure . 

After the 2-dimensional address lookup is performed 
the lookup continues with a resulting subset of rules 
associated with the current prefix found. The address 
fields are, however, not used in the final rule matching. 
Thus, if a rule is not valid for the addresses of the 
current packet it is not in the list of rules resulting 
from the address lookup. 

Since each rule is represented by a rectangle 
covering a part of the total address space and several 
rules may be applicable to the same addresses, the 
rectangles may overlap. However, in order to make the 
method according to the invention to operate in the proper 
way overlapping rectangles are not allowed. Consequently, 
in order to fulfil the non-overlap criteria the following 
steps have to be performed: 

1. For each rule, create the rectangle in the address 

space. 
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2. Create a set containing only the newly created 
rectangle. This set will be called the compare set. 

3. For all rectangles already in the plane; compare 
it to each rectangle in the compare set. 

5 4. If ~hey are overlapping, cut out the non- 

overlapping parts. The rule list of the overlapping parts 
is assigned -he rule from the new rectangle appended at the 
end thereof. 

5. For all parts - if the part was a part of the 

10 rectangle already on the plane, return it to the plane. If 
not, add it to the set of rectangles to be compared. 

6. If -he compare set is none-empty, return to step 
3. Rectangles already in the plane and which have already 
been compared can be left out. 

15 7 . At this state the compare set is empty. If any 

rectangles were overlapping the new one they are split up 
into smaller parts if needed, with the common parts having 
rule lists containing the new rule. 

In anczher method to fulfil the non-overlap criteria 
20 there is no: just a set of rectangles in the plane. 
Instead, each rectangle contains, apart from its co- 
ordinate and rule list index, a set of rectangles or' 
subrectangles . Each of the subrectangles have an additional 
set of subrectangles. However, sometimes it is necessary to 
25 refer to the same subrectangle and to traverse a directed 
Acyclic graph (DAG) of rectangles depth. 

There is always one root rectangle covering the whole 
plane. This represents the default to follow if all other 
comparison fail. The rule action is either blocked or 
30 allowed to pass depending on the configuration. 

A rectangle called root is the root rectangle to 
which a rectangle new is to be added. 

If the root and the new rectangles are of the same 
size the rules in the new rectangle is added to the rule 
5- list associated with the root rectangle. 
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Iterate over all subrectangles of the root rectangle. 
If the new rectangle can be completely covered by any of 
these, make a recursive call with the subrectangle as the 
root instead and then return. 

Once again, iterate over all subrectangles in the 

root rectangle-. ■ 

If a subrectangle can be completely contained in the 
new rectangle, it is moved from the root rectangle to the 
new rectangle. The rule list of the subrectangle and all 
rectangles under it needs to be modified to include the 
rule of the new rectangle as well. 

If the subrectangle intersects with the new 
rectangle, a new rectangle is created comprising the common 
part of the two. The rule list of the intersecting 
rectangle is a combination of the original ones. Then, the 
new rectangle is added to both the original subrectangle 
and the new rectangle. 

Once .all rectangles are added to the DAG the graph 
can be traversed and the list of prefix-defined rectangles 
that is needed by the two dimensional lookup building code 
can be produced. The intersecting rectangle will be a 
proper prefix defined rectangle, but the rest of the 
surrounding rectangle after the subrectangles have been cut 
out may not be properly defined by prefixes. 

When zhe data structure is used for filtering lookups 
as describee above, the lookup is. made in two steps. First 
a two dimensional address lookup is performed, resulting in 
an integer number. This integer is an index into an array 
of rules, wherein each rule specifies which fields 'to 
compare and what action to perform if a match was found. 
Each rule has a next field indicating which rule to 
continue wi-h in case of a mismatch; The traversing of the 
rule list is continued until a match is found, and when 
proper acticr.s are taken in order to block or forward the 
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The 2-dimensional prefix problem is solved as 
follows. 

The address space or universe U is a 2 dimensional 
space consisting of integer pairs (s,d) satisfying: 
0 < s < 2 32 , 0 < d < 2 32 . 

A subset R of U satisfying: (s,d) e R if s 0 £ s < si, 
d 0 < d < di, wherein (s 0 ,d 0 ),( Si,di)€ U is called a 
rectangle. Further, the pair of points [(s 0 ,d 0 ),( Si,di)] 
uniquely defines R. 

A rectangle defined by [(s 0 ,d 0 ), { Si,di)], where Si-s 0 - 
Sl -2 is * k s = 2 is and di-d 0 = d x -2 id * k d = 2 id for some non 
negative integers i s ,i d ,k s/ and k d is called a prefix. 

Given a point (s,d) e U and a set of prefixes P = 
{Pi,P2f ».f Pn) / such that P is a partition of U, the 2 
dimensional prefix matching problem is the problem of 
computing i such that (s,d) e Pi. 

The source-destination part of the firewall filtering 
problem is represented as a 2-dimensional prefix matching 
problem, where the set P is obtained by converting the 
routing table and the filtering rules into a partition of 
prefixes. Since each packer to be filtered requires a 
prefix matching, it becomes necessary to find a 
representation of P such that the prefix matching can be 
computed efficiently. 

A number of prefixes that partitions a small 32 x 32 
bits universe is shown in FIG 3. Black squares 18 
represents bits set (representatives) and white squares 19 
repr esents not set bits. Note: point (0,0) is located in 
the upper left corner in FIG 3, 

For each prefix P = [(s 0 ,d 0 ), (si,di)] € P the point- 
Po=(so/do) is chosen as a representative of P. Further, let 
P= (Pi, pz, . • .,Pn) = ( (si,di) , (s 2 ,d 2 ) (s„,d n ) } denotes . 
the set of representatives of the prefixes in P. 
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Given a point (s d ,d d ) € U, for each (s,d) € U, such 
that s d > s and d d > d, (s d/ d d ) is a dominating point of 
(s,d), or alternatively, (s,d) is dominated by (s d ,d d ).^ 

Given a pair of points (si,di) , (s 2 ,d 2 ) e U, the 
distance between the points under the norm. is given by: 



Now, given a point p=(s,d), the problem of finding 
the matching prefix in P is equivalent to the problem of 
finding the closest dominating point p in p under the norm 
L., i.e. the dominating point of pi e p of p minimizing the 
L^-distance between pi and p. Hence, it is sufficient to 
represent only the dominating points instead of the 
prefixes themselves . 

As shown in FIG 4, the set p is conceptually 
represented as a 2 32 x 2 32 points bit matrix, where bit p is 
set if p e p. To reduce the space required for the 
representation, we actually represent p as a four level 
2 8 * B -ary tree. Each level is (again) conceptually 
represented as a 2 8 x 2 3 bits bit matrix where bit (s,d) is 
set if there is a dominating point in the sub-tree below. 
That is, at level 1 (the top level), bit (s,d) represents 
the presence or absence of a dominating point in the 
rectangle [ (2 z ^s, 2 24 *d) , (2 24+ (s+l) ,• 2 24 *(d+l) ) ] of U. 

The actual representation of a level is a 2- 
dimensional dense chunk or simply a 2d-chunk. How and when 
a level can be represented by a 1-dimensional dense chunk 
is discussed later. A 2d-chunk consists of 32 x 32 tiles, 
where each tile represents 8x8 bits. Since the points 
defining a tile are dominating points of prefixes, not all 
2 64 kinds of tiles are possible. In fact, we impose a 
restriction cr. the tiles so that only 677 different kinds 
are possible. 
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If there is a point in a tile T (a point in some of 
the sub-universes represented by one of the bits in the 
tile) having its closest dominating point in another tile 
T d then all points in T have their closest dominating 

5 points in T d . The definition of a dominating point is 
extended to a dominating tile. The tile T d is called a 
dominating tile of T, or alternatively, tile T is dominated 
by the tile T d , 

In order to fulfil the requirement of the previous 

10 definition the following lemma is needed. 

If P = [(s 0 ,d 0 ),( si,d x )] is a prefix satisfying s a - 
s 0 >l, then [(s 0 ,d 0 ),( s 0 +2 i / di)] and [ ( So+2 1 , d 0 ) , ( si,di)], 
wherein si-s 0 =2 1 for some none-negative integer i, are also 
prefixes. The lemma for the other dimension is symmetrical. 

15 By the lemma above, a prefix can be cut into 2 parts 

whenever required. Hence, given a set of prefixes P d with 
representatives in the tile T d we can repeatedly cut them 
until all prefixes has their endpoints in the same tile, in 
both dimensions, to fulfil the requirement above. This is 

20 called tile cutting and a crucial part of the construction 
of dense2d chunks . 

The different kinds of tiles are divided into seven 
classes shown in FIG 5-11. For each class the/a tile is 
shown as a bit matrix in (asterisks represents bits that 

25 can be either 0 or 1) . For each bit set (not *) and tile 
class there are also lines indicating the guaranteed 
boundaries cf the subset dominated by that bit (point) . 
Note that a set bit in a tile can typically dominate points 
in other tiles to the right and/or below. We also give the 

30 number of different kinds of tiles in the class and 

distinguish between natural and restricted tile classes. 
Finally, we describe how the tiles are represented/encoded 
in the dense2d chunk. 

A class (0, 0) tile is shown in FIG 5. No bit is set: 

35 natural, 1 kind, and always dominated by a tile T d from 
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class (1, 1), (1/ 2), (2, 1), (1, 3+), or (3+, 1). Finding 
the dominating point of a point in bit (s b ,d b ) in a class 
(0, 0) tile is exactly the same as finding the dominating 
point of the corresponding point in bit (s b ,d b ) of its 
dominating tile T d . Hence, a class (0, 0) tile can, and 
should, always be encoded exactly the same way as its 
dominating tile T d . 

A class (1, 1) tile is shown in FIG 6. One bit is 
set: natural, 1 kind, and possibly dominates class (0, 0) 
tiles to the right and/or below. Since all points within 
this tile has the same closest dominating point, we simply 
encode a reference to that point within the tile itself 

A class (1, 2) tile is shown in FIG 7. Two bits in 
the first row (D-dimension) are set: natural, 1 kind, and 
possibly dominates class (0, 0) tiles below. Can not 
dominate class (0, 0) tiles to the right. 

There are two closest dominating points of the points 
in this tile, one for the points in the left half, and one 
for the points in the right half. We encode references to 
both these dominating points as an array of length 2, and 
can then use the left/right half of the query point as 
indices . 

A class (2, 1) tile is shown in FIG 8. Two bits in 
the first: column (S-dimension) are set: natural, 1 kind, 
25 and possibly dominates class (0, 0) tiles to the right. Can 
not dominate class (0, .0) tiles, below. There are two 
closest dominating points of the points in this tile, one 
for the points in the top half, and one for the points in 
the bottom half. References to both these dominating points 
are encoded as an array of length 2, and can then use the 
top/bottom naif of the query point as indices. 

A class (1, 3 + ) tile is shown in FIG 9. Three or more 
bits in the first row are set: natural, 24 kinds, and 
possibly dominates class (0, 0) tiles below. Can not 
dcrrr-nate class (0, 0) tiles co the right. There may be many 
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dominating points of the points in this class of tiles. It 
is necessary to encode the kind of the tile since there are 
24 different kinds of tiles. Further, for each bit set in 
the first row, a pointer to the dominating point below (if 

5 there is only one) or to the next level chunk (if there 
several dominating points) are encoded. Finally/ a 
reference tc the first pointer is encoded (a base pointer) . 
In this way, the dominating point {or a reference to the 
next level chunk) of a query point (s,d) can be found by 

10 simply inspecting in which column the d is and together 
with the kind of the chunk perform a table lookup to 
retrieve a pointer offset x, and finally retrieve the 
pointer x pointers away from the .base pointer. Note that 
any next level chunk only needs to be one (D-) dimensional 

15 since all representatives in the tile -lies on the same S- 
co-ordinate . 

A class (3+, 1) tile is shown in FIG 10. Three or 
more bits in the first column are set: natural, 2 4 kinds, 
and possibly dominates class (0, 0) tiles to the right. Can 

20 not dominate class (0, 0) tiles below. There may be many 
dominating points of the points in this class of tiles. It 
is necessary to encode the kind of the tile since there are 
. 24 different kinds. Further, for each set bit in the first 
column, a pointer to the dominating "point below (if there 

25 is only one) or to the next level chunk (if there several 
dominating points) are encoded. Finally, a reference to the 
first pointer' is encoded (a base pointer). In this way, the 
dominating point (or a reference to the next level chunk) 
of a query point (s,d) can be found by simply inspecting in 

30 which row the s is and together with the kind of the chunk 
perform a table lookup to retrieve a pointer offset x, and 
finally retrieve the pointer x pointers away from the base 
pointer. Note that any next level chunk only. needs to be 
one (S-) dimensional since all representatives m the tile 

35 lies on the same D-co-ordinate . 
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A class (2 + , 2 + ) tile is shown in FIG 11. Two or more 
bits are set in both the first row and the first column: 
restricted, 625 kinds, can not dominate another tile, and 
" can not be dominated by another tile. There are typically 

5 many dominating points in this class of tiles. The encoding 
is performed exactly as for class (1, 3+) and (3 + , 1) 
tiles. However, a restriction is imposed to reduce the 
number of different kinds before performing the actual 
encoding. The first task is to impose a restriction similar 

10 to the tile restriction of Definition 8 on each bit. Then a 
pair of bit vectors of length 8, Sv and Dv, is computed 
wherein 

Si = 1, if there is a bit set in the ith row, and 
15 0, otherwise 

Di = 1, if there is a bit set in the ith column, and 
0, otherwise 

A new tile is finally created, by computing the 
20 product of Sv and Dv T using matrix multiplication, and 
encoded. 

As in class {1, 3+) and (3+, D tiles, one 
dimensional sub-levels may be provided also in this case. 
It is checked whether all representatives in a bit, 
25 containing more than one representative, is in the same row 
in U, which means that the S-dimension collapses, or on the 
same column in U, which means that the D-dimension 
collapses . 

A further. description of the data structures used in 
30 the firewall for representing NAT and HP entries. 

In both cases, the pair of IP addresses saddr and 
daddr, the pair of ports sport and dport, and the protocol 
proto of the processed packer are used as key in the 
lookup. The first step in the lookup is to compute a hash 
35 -value. This is accomplished using very simple "and fast 
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instructions such as bit shifts bit-wise logical operators, 
^ Using the hash value as index, a 16 bits pointer is then 
retrieved from a large array (the Hash table) . 

The pointer is either 0, which means that the lookup # 
5 failed (empty) or refers to the root of a Patricia tree, 
which is a very efficient data structure for representing 
small sets of keys. If the pointer refers to a Patricia 
tree, a key is built by concatenating the bit patterns of 
saddr r daddr, sport, dport f and proto. The key is then used 
10 when searching the Patricia tree as described in the next 
section. 

A Patricia Tree, is a binary tree that treats query 
keys as bit arrays, and uses a bit index in each internal 
node to direct the branching. Searching is accomplished by 

15 traversing the tree from the root to a leaf. When visiting 
an internal node with bit index i, bit i of the query key 
is inspected to determine whether to continue the search in 
the left (if the bit is 0) or right (if the bit is 1) sub- 
tree. The traversal stops when arriving at a leaf. To 

20 determine if the query key is present in the table or not, 
the query key is then compared to the key srored in that 
leaf. If the two keys are equal, the search is successful. 

FIG 12 illustrates an example of an unsuccessful 
search for the query key .001111 in a Patricia Tree 

25 containing six keys. Bits nc. 0, 2, and 3 are inspected 
during the traversal, which ends at the leaf with key 
011101. As the query and leaf keys are compared, a mismatch 
is detected in bit no. 1. 

With respect to the bit indices stored in the 

30 internal nodes, a Patricia Tree is heap ordered. Thafis, 
any internal node, except the root, has a bit index greater 
than, the bit index of its parent. It follows that all keys 
stored in a sub-tree rooted at a node with bit index i are 
identical up to, and including, bit i-1. 
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Insertion is accomplished by first performing an 
unsuccessful search, and recording the index i of the first 
mismatching bit in the comparison of the query and leaf 
key. Two new nodes are then created, a new internal node 
5 with index i and a leaf node for the query key. Depending 
on whether the i th bit of the query key is 0 or 1, the 
leaf is stored as the left or right sub-tree, respectively, 
of the internal node. By using the other sub-tree field as 
link field, -he internal node is then inserted directly 
10 above the node with smallest bit index larger than i in the 
path traversed from the root to the leaf. 

FIG 13 shows the Patricia Tree resulting from 
inserting the query key from the unsuccessful search of the 
previous example in FIG 12. A new internal node with bit 
15 index 1 is created, and inserted between the nodes with bit 
indices 0 and 2, in the path traversed from the root. 

The Parricia Hashing used for hole punching works 
exactly as described above -a simple Hash table lookup 
followed by a Patricia tree lookup. Most of the time, a 
20 leaf is reached directly, which means that it is not 

necessary zz build a bit array from the parameters - these 
are compared directly to corresponding fields in the 
structure containing/representing the Patricia . leaf . 

One lookup function hp_lookup (iaddr, xaddr, iport, 
25 xport, protc) is provided that are used both for I2X-HP and 
X2I-HP. The only difference between these are the order in 
which the parameters are given. For I2X-HP, the function 
call is hp_iookup(saddr, daddr , sport, dport, proto) and 
• for X2I-HP -he call is hp^lookup (daddr , saddr, dport, 
30 sport, proto) . 

o. 

The lookup function returns a reference to a 
structure containing the Patricia leaf key, i.e. iaddr, 
xaddr, iporz, xport, and proto, and a couple of other 
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fields representing the state of the connection, for 
example TCP sequence numbers. 

The Patricia Hashing for NAT is slightly more 
complicated than for HP. The reason is that three different 

5 addresses and ports, iaddr, naddr, xaddr, iport, nport, 
xport r are involved, as opposed to HP where only two 
addresses and ports are involved. This means that the. 
difference between I2X and X2I becomes a little more tricky 
than just swapping addresses and ports in the lookup. 

10 The problem is solved by letting the least 

significant bit of the hash value reflect if the lookup is 
I2X or X2I (this is essentially the same as using two hash 
tables) . The structure containing the Patricia leaf keys 
for a NAT connection is the same for I2X and X2I and it 

15 contains all three addresses and ports. 

There are two lookup functions, nat_i2x__lookup (saddr , 
daddr, sport, dport, proto) and nat_x2i_lookup (saddr , 
daddr f sport, dport, proto). Both functions uses the 
arguments to compute a hash value where the least 

20 significant bit is set to accordingly. If the resulting 
pointer refers to a Patricia node (internal node), the 
addresses, ports, and protocol are concatenated to create 
the bit array needed for traversing the Patricia tree. When 
the leaf structure is reached, the addresses, ports, and 

25 protocol are compared to the corresponding fields in the 
leaf. 

When a packet is subject to I2X-NAT: 

saddr (of the packet) is compared to iaddr (of the 
30 leaf structure) 

daddr is compared to xaddr 
sporz is compared to iport 
dporz is compared to xport 
proto is compared to proto 
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If all of these matches, the lookup is successful, 
and the source address and port, saddr and sport, of the 
packet are replaced by naddr and nport (of the leaf 
structure) , respectively, before the packet is forwarded. 

5 

When a packet is subject toX2I-NAT: 

saddr (of the packet) is compared to xaddr (of the 
leaf structure) 
10 daddr is compared to naddr 

sport is compared to xport 
dpcrt is compared to nport 
proto is compared to proto 

15 if all of these matches, the lookup is successful, 

and the destination address and port, daddr and dport, of 
the packet are replaced by iaddr and iport (of the leaf 
structure) , respectively, before the packet is sent to the 
next processing step. 

20 Updates of the HP and NAT data structures are 

performed by the EffNIX kernel' (previously NetBSD) running 
on the ES? (processor 1) but most of the lookups are 
performed by the forwarding kernel running on the AP 
(processor 2;. There are only one instance of the HP data 

25 structure and one instance of the NAT data structure. These 
resides in shared memory and are -accessed by the two 
processors simultaneously- This results in a very 
interesting synchronisation problem - oneawriter and one 
reader. The synchronisation is solved by letting the update 

30 routines invalidate the leafs structures and nodes before 
changing anything (writing) . The lookup routines checks 
that the accessed leafs and nodes are valid before and 
after they have been accessed, and also that they have not 
been changed during the access. If a race occurs and is 

35 detected (all dangerous race conditions are detected) the 



WO 00/02114 



28 



PCT/SE99/01202 



lookup fails and the packet is sent to the BSP and dealt 
with there (either a successful lookup followed by 
processing is performed, or the data structures are 
updated) . 

It should be apparent that the present invention 
provides a firewall apparatus and a method of controlling 
network data packet traffic between internal and external 
networks that fully satisfies the aims and advantages set 
forth above . 

Although the invention has been described in conjunc- 
tion with a specific embodiment thereof, this invention is 
susceptible of embodiments in different forms, with the 
understanding that the present disclosure is to be con- 
sidered as an exemplification of the principles of the in- 
vention and is not intended to limit the invention to the 
specific embodiment illustrated. 
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CLAIMS 

1. A firewall (3) for controlling network data packet 
■ traffic between internal and external networks (1,5,4), 

comprising filtering means for selecting from a total set 
5 of rules, in dependence of the contents in data fields of a 
data packet being transmitted between said networks a rule 
applicable to said data packet, in order to block said 
packet or to forwarded said packet through the firewall 
(3), characterized by 2-dimensional address 
10 lookup means (8) for a 2-dimensional lookup of said source 
and destination addresses of the packet in a set of address 
prefixes, each prefix having a subset of rules of the total 
set of rules, in order to find a prefix, via its 
representation, associated with said source and destination 
15 addresses, and rule matching means (10) for rule matching - 
on the basis of the contents of said data fields in order 
to find the rule applicable to said data packet. 

2. A firewall according to claim 1, 

20 characterized in that said 2-dimensional 

address lookup means (3} comprises means for finding the 
prefix associated with said source and destination 
addresses by determining the closest dominating point p in 
p under the norm L B/ .i.e. the dominating point of pi e p of 

25 p minimising the L^-distance between pi and p. 



3. A firewall according to claim 2, 
characterized in that 
30 the source and destination addresses are represented 

by a point ;s,d) e U, wherein U is a 2 dimensional address 
space represented by integer pairs (s,d) satisfying: 
0 < s < 2 5: , 3 < d < 2 32 , 

the prefixes P = { ?i, ? 2 , Pn) is a partition of the 
35 address spate U, and 
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each prefix Pi is a logical rectangle R in the 
address space U defined by [(s 0 ,d 0 ), ( s x , di) ], . where Si-s 0 = 
Si-2 is * k s = 2 is and di-d 0 = di-2 id * k d = 2 id for some non 
negative integers i s ,i d ,k s , an d k d , 



satisfying: is,d) e R if s 0 < s < s lf d 0 < d < di, wherein 
(s 0 ,d 0 ),( si,di)€ U, and the pair of points [(s 0 ,d 0 ),( Si,di)] 
uniquely defines said rectangle R. 

4. A firewall according to claim 2 or 3, 
characterized in that 

for each prefix P = [ (s 0 , d 0 ) , (si, d x ) ] € P, the point 
Po=(so/d 0 ) is a representative of P, and p = {pi, P2/ - - 

,,p n } = { (s^di) , (s 2 ,d 2 ) (s„,d„) } is the set of 
representatives of the prefixes in P, wherein given a point 

(s d ,d d ) € U, for each (s,d) e U, wherein s d > s and d d > d, 

(s,d) is dominated by (s d ,d d ). 

5, A firewall according to claim 3, 
characterized in that, given a pair of points 
(Si, di) , (s 2 , d:? e U, the distance between the points under 
the norm L x is given by: 



6. A firewall according " to any of the preceding 
claims, characterized by a fragment machine 

(11) comprising fragment collecting means for collecting 
packet fragments from a fragmented packet until a fragment 
.header of said packet is received, fragment header storing 
means for storing in an entry means information present in 
a fragment header field of the packet, fragment forwarding 
means for forwarding packet fragments provided with 

fragment header information starting with the fragment 



said logical rectangle R being. a subset of U 
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header, wherein each fragment is processed by the filtering 
means as a regular unfragmented packet. 

7 . A firewall according to any of the preceding 
claims, characterized by network address 
translation means (12,14) for translating, in dependence of 
the information in the prefix, internal source addresses to 
external source addresses of a packet transmitted out 
through the firewall (3), or external source addresses to 
internal source addresses of a packet transmitted in 
through the firewall (3) . 

8. A firewall according to any of the claims 1-6, 
characterized by network address translation 
means (12, 14) for translating, in dependence of the 
information in the prefix internal source addresses to 
external source addresses of a packet transmitted from the 
internal network (1) to the external network (4), or 
external source addresses to internal source addresses of a 
packet transmitted from the external network (4) to the 
internal network (1) . 

9. A firewall according to any of the preceding 
claims, characterized by hole punching means 

' (16,17) for determining, on the basis of the information in 
the prefix, if said packet is subject to a temporary 
exception from an external-to-internal blocking rule for a 
connection initiated from the internal network, wherein a 
return channel for packets transmitted from the external 
network (4) :o the internal network (1) is established 
through the firewall during the lifetime of the connection. 

10. A firewall (3) for controlling network data packet 
traffic between internal and external networks (1,5,4), 
comprising filtering means for selecting from a total set 



WO 00/02114 



32 



PCT/SE99/01202 



of rules, in dependence of the contents in data fields of a 
data packet being transmitted between said networks , a rule 
applicable to the data packet, in order to block said 
packet or to forwarded the packet through the firewall (3) , 
characterized by a fragment machine (11) com- 
prising fragment collecting means for collecting pa'cket 
fragments from a fragmented packet until a fragment header 
of said packet is received, fragment header storing means 
for storing in an entry means information present in a 
fragment header field of the packet, .fragment forwarding 
means for forwarding packet fragments provided with 
fragment header information starting with the fragment 
header, wherein each fragment is processed by the filtering 
means as a regular unfragmented packet. 

11. A method of controlling network data packet 
traffic between internal (1,5) and external networks (4) 
through a firewall (3), comprising the steps of, 

selecting from a total set of rules, in dependence 
of the contents in the data fields of a data packet being 
transmitted between said networks, a rule applicable to the 
data packet, 

applying said rule on said packet, and 

depending on the rule, blocking said packet or 
forwarding said packet through the firewall (3), 

characterized in that said filtering 
comprises the further steps of: 

performing a 2-dimensional lookup of the source and 
destination addresses of the packet in order to find a 
prefix, via its representation, associated with said source 
and destination addresses in a set of address prefixes, 
each prefix having a subset of rules of the total set of 
rules , 

and or. the basis of the contents of said data fields 
of the packet, performing a rule matching on the subset of 
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rules in order to find the rule applicable to the data 
packet . 

12. A method according to claim 11, 
characterized in that preceding the step of 
selecting a rule applicable to the data packet it comprises 
the further steps of: 

collecting packet fragments from a fragmented packet 
until a fragment header of said packet is received, 

storing in an entry means information present in a 
fragment header field of the packet, and 

forwarding packet fragments provided with fragment 
header information starting with the fragment header, 
wherein each fragment is processed by the filtering means 
as a regular unfragmented packet. 

13. A method according to claim 11 or 12, 
characterized in that preceding the step of 
performing a rule matching it comprises the further step 
of: 

in dependence of the information in the prefix, 
translating the external source address to an internal 
source address of a packet to be transmitted in through the 
firewall (3) . 

14. A method according to any of the preceding claims 
11-13, characterized in that preceding the 
step of performing a rule matching it comprises the further 
step of: 

depending on the information in the prefix, 
translating the external source address to an internal 
source address of a packet to be transmitted from the 
external network (4) to the internal network (1,5). 
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15. A method according to any of the preceding claims 
11-14, characterized by the further step of: 

depending on the-, information in the prefix 
translating the internal source address to an external 
source address of a packet to be transmitted out through 
the firewall (3) . 

16. A method according to any of the preceding claims 
11-15, characterized by the further step of: 

depending on the information in the prefix 
translating ' the internal source address to an external 
source address of a packet to be transmitted from the 
internal network (4) to the external network (1) . 

17. A method according to any of the preceding claims 
11-16, characterized in that preceding the 
step of performing a rule matching it comprises the further 
steps of: 

based on the information in the prefix, determining 
if said packet is subject to a temporary exception from an 
external -to- internal blocking rule for a connection 
initiated from the internal network (1), 

if so, establishing a return channel for packets 
transmitted from the external network .(4) to the internal 
network (1) through the firewall (3), having a duration 
corresponding to the lifetime of the connection. 

.18. A method of controlling network data packet 
traffic between internal and external networks (1,5,4) 
through a firewall (3), comprising the steps of, 

in deoendence of the contents in the data fields of a 
data packet being transmitted between said networks, 
selecting from a total set of rules a rule applicable to 
the data packet, 

applying said rule on said packet, 
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and depending on the rule, blocking said packet or 
forwarding said packet through the firewall (3) , 
characterized in that preceding the step -of 
selecting a rule applicable to the data packet it comprises 
5 the further steps of: 

collecting packet fragments from a fragmented packet 
until a fragment header of said packet is received, 

storing in an entry means information present in a 
fragment header field of the packet, and 
10 forwarding packet fragments provided with fragment 

header information starting with the fragment header, 
wherein each fragment is processed by the filtering means 
as a regular unfragmented packet. 

15 19. A method according to any of the preceding claims 

11-18, characterized in that the step of 
performing a 2-dimensional lookup of the source and 
destination addresses of the packet comprises the further 
step of: 

20 finding the closest dominating point p in p under the 

norm L^, i.e. the dominating point or p : e p of p, which 
minimises the L w -distance between pi and p. 

20. A method according to claim 19, 
.25 characterized in that 

the source and destination addresses are represented 
by a point (s,d) e U, wherein U is a 2 dimensional address 
space represented by integer pairs (s,d) satisfying: 
0 < s < 2 32 , C < d < 2 32 , 
30 the set of prefixes P = { Pi, P 2 , Pn } is a partition of 

the address space U, 

each prefix Pi is a logical rectangle R in the 
address space U defined by [(s 0 ,d 0 ), ( Si,di)], where si-s 0 = 
si-2 is * k s = 2 :s and di-d 0 = di~2 id * k d = 2 ld for some non 
?5 negative integers i s ,i d ,k s/ and k d , wherein the logical 
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rectangle R is a subset of U satisfying: (s;d) e R if s 0 < 
s < s x , d 0 < d < di, wherein (s 0 ,d 0 ),( Si,di)€ U, and the/ 
pair of points [(s 0f d 0 ), ( s if di)] uniquely defines said 
rectangle R, 

for each prefix P = [(s 0 ,d 0 ) , (s x , di)] e P, the point 
(so, do)' is a representative of P, and p - {pi, *P2/ • - -/Pn} 
= { (si/di) , (s 2 ,d 2 ) ,~, (s n ,d„) } are the set of representatives 
of the prefixes in P, wherein given a point (s d ,d d ) e U , 
for each (s,d) € U, wherein s d £ s and d d > d f (s,d) is 
dominated by (s d ,d d ), and 

given a pair of points (si,di), (s 2 ,d 2 ) € U, the 
distance between the points under the norm is given. by: 



lim* -»ooy|j, -s 2 \* + \d x -d 2 \ k = maxfls, -s 2 \ 9 \d x -</ 2 |) . 
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